LitLin 18_4 361-378 fqh002 FIN
نویسندگان
چکیده
This paper presents the newly released Lancaster Corpus of Mandarin Chinese (LCMC), a Chinese match for the FLOB and Frown corpora of British and American English. We first discuss the major decisions we took when building the corpus. These relate to sampling, text collection, mark-up, and annotation. Following from this we use the corpus to study aspect marking in Chinese and British/American English. The study shows that although Chinese and English are typologically different, aspect markers in the two languages show a strikingly similar distribution pattern, especially across the two broad categories of narrative and expository texts. The study also reveals some important differences in the distribution of aspect markers in Chinese versus English and British versus American English across fifteen text categories, and provides an account of these differences. Correspondence:
منابع مشابه
LitLin 18_4 423-447 fqh009 FIN
Large, real world, data sets have been investigated in the context of Authorship Attribution of real world documents. Ngram measures can be used to accurately assign authorship for long documents such as novels. A number of 5 (authors) 5 (movies) arrays of movie reviews were acquired from the Internet Movie Database. Both ngram and naive Bayes classifiers were used to classify along both the au...
متن کاملLitLin 19_4 453-475 fqh034 FIN
Delta, a simple measure of the difference between two texts, has been proposed by John F. Burrows as a tool in authorship attribution problems, particularly in large ‘open’ problems in which conventional methods of attribution are not able to limit the claimants effectively. This paper tests Delta’s effectiveness and accuracy, and shows that it works nearly as well on prose as it does on poetry...
متن کاملNICHOLAS KAHN - FOGEL Western Universalism and African
........................................................................................... 315 Introduction ...................................................................................... 317 I. Law and Contemporary Attitudes Toward African Homosexuality ....................................................................... 323 II. Liberal Philosophy and Liberal Legal Frameworks .........
متن کاملDistribution of histamine in human blood.
Introduction ............................................................. 361 h~ethodology ............................................................ 363 Relative specificities of methods. ......................................... 365 Basophil Leukocytes. ..................................................... 366 Eosinophil leukocyte in relation to basophil leukocyte. ....................... ...
متن کاملSimultaneous Determination of Amlodipine Besylate and Atorvastatin Calcium in Binary Mixture by Spectrofluorimetry and HPLC Coupled with Fluorescence Detection
Caduet tablets are novel prescription drug that combines amlodipine besylate (AM) with atorvastatin calcium (AT). A spectrofluorimetric and an HPLC-fluorescence detection methods were developed for simultaneous determination of both drugs in tablets. In the spectrofluorimetric method, native fluorescence of AM and AT were measured in methanol at 442 and 369 nm upon excitation at 361 and 274 nm,...
متن کامل